Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 85855 |
| Missing cells | 101139 |
| Missing cells (%) | 5.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 14.4 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Categorical | 13 |
|---|---|
| Numeric | 9 |
imdb_title_id has a high cardinality: 85855 distinct values | High cardinality |
title has a high cardinality: 82094 distinct values | High cardinality |
original_title has a high cardinality: 80852 distinct values | High cardinality |
year has a high cardinality: 113 distinct values | High cardinality |
date_published has a high cardinality: 22012 distinct values | High cardinality |
genre has a high cardinality: 1257 distinct values | High cardinality |
country has a high cardinality: 4907 distinct values | High cardinality |
language has a high cardinality: 4377 distinct values | High cardinality |
director has a high cardinality: 34733 distinct values | High cardinality |
writer has a high cardinality: 66859 distinct values | High cardinality |
production_company has a high cardinality: 32050 distinct values | High cardinality |
actors has a high cardinality: 85729 distinct values | High cardinality |
description has a high cardinality: 83611 distinct values | High cardinality |
avg_vote is highly correlated with metascore | High correlation |
votes is highly correlated with usa_gross_income and 2 other fields | High correlation |
usa_gross_income is highly correlated with votes and 3 other fields | High correlation |
worlwide_gross_income is highly correlated with usa_gross_income and 2 other fields | High correlation |
metascore is highly correlated with avg_vote | High correlation |
reviews_from_users is highly correlated with votes and 3 other fields | High correlation |
reviews_from_critics is highly correlated with votes and 3 other fields | High correlation |
writer has 1572 (1.8%) missing values | Missing |
production_company has 4455 (5.2%) missing values | Missing |
description has 2115 (2.5%) missing values | Missing |
metascore has 72550 (84.5%) missing values | Missing |
reviews_from_users has 7597 (8.8%) missing values | Missing |
reviews_from_critics has 11797 (13.7%) missing values | Missing |
budget is highly skewed (γ1 = 176.1867109) | Skewed |
imdb_title_id is uniformly distributed | Uniform |
title is uniformly distributed | Uniform |
original_title is uniformly distributed | Uniform |
writer is uniformly distributed | Uniform |
actors is uniformly distributed | Uniform |
description is uniformly distributed | Uniform |
imdb_title_id has unique values | Unique |
budget has 62179 (72.4%) zeros | Zeros |
usa_gross_income has 70529 (82.1%) zeros | Zeros |
worlwide_gross_income has 54839 (63.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-04 13:34:33.347704 |
|---|---|
| Analysis finished | 2022-10-04 13:35:30.392595 |
| Duration | 57.04 seconds |
| Software version | pandas-profiling v3.3.1 |
| Download configuration | config.json |
| Distinct | 85855 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| tt0000009 | 1 |
|---|---|
| tt1347008 | 1 |
| tt1347006 | 1 |
| tt1346973 | 1 |
| tt1346961 | 1 |
| Other values (85850) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.011659193 |
| Min length | 9 |
Characters and Unicode
| Total characters | 773696 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 85855 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0000009 |
|---|---|
| 2nd row | tt0000574 |
| 3rd row | tt0001892 |
| 4th row | tt0002101 |
| 5th row | tt0002130 |
Common Values
| Value | Count | Frequency (%) |
| tt0000009 | 1 | < 0.1% |
| tt1347008 | 1 | < 0.1% |
| tt1347006 | 1 | < 0.1% |
| tt1346973 | 1 | < 0.1% |
| tt1346961 | 1 | < 0.1% |
| tt1346850 | 1 | < 0.1% |
| tt1346629 | 1 | < 0.1% |
| tt1346302 | 1 | < 0.1% |
| tt1346281 | 1 | < 0.1% |
| tt1345904 | 1 | < 0.1% |
| Other values (85845) | 85845 |
Length
| Value | Count | Frequency (%) |
| tt0000009 | 1 | < 0.1% |
| tt0003637 | 1 | < 0.1% |
| tt0001892 | 1 | < 0.1% |
| tt0002101 | 1 | < 0.1% |
| tt0002130 | 1 | < 0.1% |
| tt0002199 | 1 | < 0.1% |
| tt0002423 | 1 | < 0.1% |
| tt0002445 | 1 | < 0.1% |
| tt0002452 | 1 | < 0.1% |
| tt0002461 | 1 | < 0.1% |
| Other values (85845) | 85845 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 171710 | |
| 0 | 126521 | |
| 1 | 66918 | 8.6% |
| 2 | 59104 | 7.6% |
| 4 | 55377 | 7.2% |
| 3 | 52321 | 6.8% |
| 6 | 51167 | 6.6% |
| 8 | 50723 | 6.6% |
| 5 | 47360 | 6.1% |
| 7 | 46862 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 601986 | |
| Lowercase Letter | 171710 | 22.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 126521 | |
| 1 | 66918 | |
| 2 | 59104 | |
| 4 | 55377 | |
| 3 | 52321 | |
| 6 | 51167 | |
| 8 | 50723 | |
| 5 | 47360 | 7.9% |
| 7 | 46862 | 7.8% |
| 9 | 45633 | 7.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 171710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 601986 | |
| Latin | 171710 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 126521 | |
| 1 | 66918 | |
| 2 | 59104 | |
| 4 | 55377 | |
| 3 | 52321 | |
| 6 | 51167 | |
| 8 | 50723 | |
| 5 | 47360 | 7.9% |
| 7 | 46862 | 7.8% |
| 9 | 45633 | 7.6% |
Latin
| Value | Count | Frequency (%) |
| t | 171710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 773696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 171710 | |
| 0 | 126521 | |
| 1 | 66918 | 8.6% |
| 2 | 59104 | 7.6% |
| 4 | 55377 | 7.2% |
| 3 | 52321 | 6.8% |
| 6 | 51167 | 6.6% |
| 8 | 50723 | 6.6% |
| 5 | 47360 | 6.1% |
| 7 | 46862 | 6.1% |
| Distinct | 82094 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| Anna | 10 |
|---|---|
| Darling | 8 |
| Wanted | 7 |
| Vendetta | 7 |
| Lucky | 7 |
| Other values (82089) |
Length
| Max length | 196 |
|---|---|
| Median length | 84 |
| Mean length | 16.9734203 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1457253 |
|---|---|
| Distinct characters | 153 |
| Distinct categories | 16 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 79153 ? |
|---|---|
| Unique (%) | 92.2% |
Sample
| 1st row | Miss Jerry |
|---|---|
| 2nd row | The Story of the Kelly Gang |
| 3rd row | Den sorte drøm |
| 4th row | Cleopatra |
| 5th row | L'Inferno |
Common Values
| Value | Count | Frequency (%) |
| Anna | 10 | < 0.1% |
| Darling | 8 | < 0.1% |
| Wanted | 7 | < 0.1% |
| Vendetta | 7 | < 0.1% |
| Lucky | 7 | < 0.1% |
| I miserabili | 7 | < 0.1% |
| Solo | 7 | < 0.1% |
| Maya | 7 | < 0.1% |
| Aurora | 7 | < 0.1% |
| Alone | 7 | < 0.1% |
| Other values (82084) | 85781 |
Length
| Value | Count | Frequency (%) |
| the | 7894 | 3.1% |
| la | 5049 | 2.0% |
| il | 4206 | 1.6% |
| 4017 | 1.6% | |
| di | 3611 | 1.4% |
| of | 2435 | 1.0% |
| a | 2428 | 0.9% |
| del | 1849 | 0.7% |
| in | 1807 | 0.7% |
| i | 1781 | 0.7% |
| Other values (61622) | 220536 |
Most occurring characters
| Value | Count | Frequency (%) |
| 169758 | 11.6% | |
| a | 131288 | 9.0% |
| e | 128227 | 8.8% |
| i | 100977 | 6.9% |
| o | 94336 | 6.5% |
| n | 81397 | 5.6% |
| r | 75148 | 5.2% |
| t | 63652 | 4.4% |
| l | 63426 | 4.4% |
| s | 54820 | 3.8% |
| Other values (143) | 494224 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1102498 | |
| Space Separator | 169758 | 11.6% |
| Uppercase Letter | 154170 | 10.6% |
| Other Punctuation | 17654 | 1.2% |
| Decimal Number | 6852 | 0.5% |
| Dash Punctuation | 5736 | 0.4% |
| Close Punctuation | 219 | < 0.1% |
| Open Punctuation | 217 | < 0.1% |
| Math Symbol | 55 | < 0.1% |
| Other Letter | 28 | < 0.1% |
| Other values (6) | 66 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 131288 | |
| e | 128227 | |
| i | 100977 | 9.2% |
| o | 94336 | 8.6% |
| n | 81397 | 7.4% |
| r | 75148 | 6.8% |
| t | 63652 | 5.8% |
| l | 63426 | 5.8% |
| s | 54820 | 5.0% |
| d | 40255 | 3.7% |
| Other values (47) | 268972 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13274 | 8.6% |
| L | 12911 | 8.4% |
| T | 12497 | 8.1% |
| M | 9848 | 6.4% |
| A | 9117 | 5.9% |
| B | 9042 | 5.9% |
| I | 8703 | 5.6% |
| D | 8540 | 5.5% |
| C | 8444 | 5.5% |
| P | 7100 | 4.6% |
| Other values (37) | 54694 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 6081 | |
| . | 3927 | |
| : | 3108 | |
| , | 1949 | 11.0% |
| ! | 1290 | 7.3% |
| & | 611 | 3.5% |
| ? | 426 | 2.4% |
| / | 120 | 0.7% |
| # | 36 | 0.2% |
| ¡ | 27 | 0.2% |
| Other values (7) | 79 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1573 | |
| 1 | 1094 | |
| 0 | 1069 | |
| 3 | 852 | |
| 4 | 456 | 6.7% |
| 9 | 428 | 6.2% |
| 7 | 414 | 6.0% |
| 5 | 408 | 6.0% |
| 6 | 289 | 4.2% |
| 8 | 269 | 3.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 39 | |
| = | 10 | 18.2% |
| ~ | 4 | 7.3% |
| × | 2 | 3.6% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ² | 4 | |
| ³ | 2 | 13.3% |
| ¼ | 1 | 6.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 212 | |
| ] | 7 | 3.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 210 | |
| [ | 7 | 3.2% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 18 | |
| º | 10 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 12 | |
| £ | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 169758 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5736 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 25 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1256696 | |
| Common | 200557 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 131288 | 10.4% |
| e | 128227 | 10.2% |
| i | 100977 | 8.0% |
| o | 94336 | 7.5% |
| n | 81397 | 6.5% |
| r | 75148 | 6.0% |
| t | 63652 | 5.1% |
| l | 63426 | 5.0% |
| s | 54820 | 4.4% |
| d | 40255 | 3.2% |
| Other values (96) | 423170 |
Common
| Value | Count | Frequency (%) |
| 169758 | ||
| ' | 6081 | 3.0% |
| - | 5736 | 2.9% |
| . | 3927 | 2.0% |
| : | 3108 | 1.5% |
| , | 1949 | 1.0% |
| 2 | 1573 | 0.8% |
| ! | 1290 | 0.6% |
| 1 | 1094 | 0.5% |
| 0 | 1069 | 0.5% |
| Other values (37) | 4972 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1447863 | |
| None | 9390 | 0.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 169758 | 11.7% | |
| a | 131288 | 9.1% |
| e | 128227 | 8.9% |
| i | 100977 | 7.0% |
| o | 94336 | 6.5% |
| n | 81397 | 5.6% |
| r | 75148 | 5.2% |
| t | 63652 | 4.4% |
| l | 63426 | 4.4% |
| s | 54820 | 3.8% |
| Other values (77) | 484834 |
None
| Value | Count | Frequency (%) |
| é | 1055 | 11.2% |
| à | 798 | 8.5% |
| ô | 727 | 7.7% |
| è | 685 | 7.3% |
| ä | 636 | 6.8% |
| á | 619 | 6.6% |
| ü | 558 | 5.9% |
| í | 454 | 4.8% |
| ö | 442 | 4.7% |
| ó | 421 | 4.5% |
| Other values (56) | 2995 |
| Distinct | 80852 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| Anna | 10 |
|---|---|
| Home | 8 |
| The Three Musketeers | 8 |
| Darling | 8 |
| Solo | 8 |
| Other values (80847) |
Length
| Max length | 196 |
|---|---|
| Median length | 92 |
| Mean length | 15.72144895 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1349765 |
|---|---|
| Distinct characters | 155 |
| Distinct categories | 16 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 77123 ? |
|---|---|
| Unique (%) | 89.8% |
Sample
| 1st row | Miss Jerry |
|---|---|
| 2nd row | The Story of the Kelly Gang |
| 3rd row | Den sorte drøm |
| 4th row | Cleopatra |
| 5th row | L'Inferno |
Common Values
| Value | Count | Frequency (%) |
| Anna | 10 | < 0.1% |
| Home | 8 | < 0.1% |
| The Three Musketeers | 8 | < 0.1% |
| Darling | 8 | < 0.1% |
| Solo | 8 | < 0.1% |
| Inferno | 8 | < 0.1% |
| Wanted | 8 | < 0.1% |
| Blackout | 7 | < 0.1% |
| Eden | 7 | < 0.1% |
| Maya | 7 | < 0.1% |
| Other values (80842) | 85776 |
Length
| Value | Count | Frequency (%) |
| the | 13614 | 5.7% |
| of | 4287 | 1.8% |
| a | 2439 | 1.0% |
| la | 2198 | 0.9% |
| de | 1844 | 0.8% |
| in | 1749 | 0.7% |
| 1418 | 0.6% | |
| no | 1318 | 0.6% |
| and | 1302 | 0.5% |
| to | 1300 | 0.5% |
| Other values (62416) | 207330 |
Most occurring characters
| Value | Count | Frequency (%) |
| 152944 | 11.3% | |
| e | 125860 | 9.3% |
| a | 110153 | 8.2% |
| i | 80503 | 6.0% |
| o | 79650 | 5.9% |
| n | 77197 | 5.7% |
| r | 68591 | 5.1% |
| t | 58303 | 4.3% |
| s | 53884 | 4.0% |
| l | 47826 | 3.5% |
| Other values (145) | 494854 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 998494 | |
| Uppercase Letter | 172343 | 12.8% |
| Space Separator | 152944 | 11.3% |
| Other Punctuation | 15560 | 1.2% |
| Decimal Number | 6391 | 0.5% |
| Dash Punctuation | 3543 | 0.3% |
| Close Punctuation | 190 | < 0.1% |
| Open Punctuation | 188 | < 0.1% |
| Math Symbol | 56 | < 0.1% |
| Currency Symbol | 21 | < 0.1% |
| Other values (6) | 35 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 125860 | |
| a | 110153 | |
| i | 80503 | 8.1% |
| o | 79650 | 8.0% |
| n | 77197 | 7.7% |
| r | 68591 | 6.9% |
| t | 58303 | 5.8% |
| s | 53884 | 5.4% |
| l | 47826 | 4.8% |
| h | 41552 | 4.2% |
| Other values (47) | 254975 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 17828 | 10.3% |
| S | 15405 | 8.9% |
| M | 11837 | 6.9% |
| B | 10896 | 6.3% |
| L | 10420 | 6.0% |
| D | 10239 | 5.9% |
| A | 9828 | 5.7% |
| C | 9395 | 5.5% |
| P | 7673 | 4.5% |
| H | 7650 | 4.4% |
| Other values (37) | 61172 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4113 | |
| : | 3563 | |
| . | 3409 | |
| , | 1901 | |
| ! | 1248 | 8.0% |
| & | 635 | 4.1% |
| ? | 397 | 2.6% |
| / | 127 | 0.8% |
| # | 36 | 0.2% |
| ¡ | 34 | 0.2% |
| Other values (8) | 97 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1471 | |
| 1 | 1075 | |
| 0 | 959 | |
| 3 | 829 | |
| 4 | 398 | 6.2% |
| 9 | 397 | 6.2% |
| 5 | 365 | 5.7% |
| 7 | 359 | 5.6% |
| 6 | 275 | 4.3% |
| 8 | 263 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 39 | |
| = | 9 | 16.1% |
| ~ | 6 | 10.7% |
| × | 2 | 3.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 18 | |
| ¢ | 2 | 9.5% |
| £ | 1 | 4.8% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ² | 3 | 23.1% |
| ³ | 2 | 15.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 180 | |
| ] | 10 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 178 | |
| [ | 10 | 5.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 10 | |
| ® | 1 | 9.1% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 3 | |
| º | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 152944 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3543 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1170841 | |
| Common | 178924 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 125860 | 10.7% |
| a | 110153 | 9.4% |
| i | 80503 | 6.9% |
| o | 79650 | 6.8% |
| n | 77197 | 6.6% |
| r | 68591 | 5.9% |
| t | 58303 | 5.0% |
| s | 53884 | 4.6% |
| l | 47826 | 4.1% |
| h | 41552 | 3.5% |
| Other values (96) | 427322 |
Common
| Value | Count | Frequency (%) |
| 152944 | ||
| ' | 4113 | 2.3% |
| : | 3563 | 2.0% |
| - | 3543 | 2.0% |
| . | 3409 | 1.9% |
| , | 1901 | 1.1% |
| 2 | 1471 | 0.8% |
| ! | 1248 | 0.7% |
| 1 | 1075 | 0.6% |
| 0 | 959 | 0.5% |
| Other values (39) | 4698 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1339467 | |
| None | 10298 | 0.8% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 152944 | 11.4% | |
| e | 125860 | 9.4% |
| a | 110153 | 8.2% |
| i | 80503 | 6.0% |
| o | 79650 | 5.9% |
| n | 77197 | 5.8% |
| r | 68591 | 5.1% |
| t | 58303 | 4.4% |
| s | 53884 | 4.0% |
| l | 47826 | 3.6% |
| Other values (77) | 484556 |
None
| Value | Count | Frequency (%) |
| é | 1535 | |
| ô | 1106 | 10.7% |
| ä | 789 | 7.7% |
| á | 728 | 7.1% |
| ü | 670 | 6.5% |
| í | 535 | 5.2% |
| ö | 530 | 5.1% |
| ó | 499 | 4.8% |
| è | 455 | 4.4% |
| û | 328 | 3.2% |
| Other values (58) | 3123 |
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| 2017 | 3329 |
|---|---|
| 2018 | 3257 |
| 2016 | 3138 |
| 2015 | 2977 |
| 2014 | 2942 |
| Other values (108) |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 4.000104828 |
| Min length | 4 |
Characters and Unicode
| Total characters | 343429 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1894 |
|---|---|
| 2nd row | 1906 |
| 3rd row | 1911 |
| 4th row | 1912 |
| 5th row | 1911 |
Common Values
| Value | Count | Frequency (%) |
| 2017 | 3329 | 3.9% |
| 2018 | 3257 | 3.8% |
| 2016 | 3138 | 3.7% |
| 2015 | 2977 | 3.5% |
| 2014 | 2942 | 3.4% |
| 2019 | 2841 | 3.3% |
| 2013 | 2783 | 3.2% |
| 2012 | 2560 | 3.0% |
| 2011 | 2429 | 2.8% |
| 2009 | 2298 | 2.7% |
| Other values (103) | 57301 |
Length
| Value | Count | Frequency (%) |
| 2017 | 3329 | 3.9% |
| 2018 | 3257 | 3.8% |
| 2016 | 3138 | 3.7% |
| 2015 | 2977 | 3.5% |
| 2014 | 2942 | 3.4% |
| 2019 | 2842 | 3.3% |
| 2013 | 2783 | 3.2% |
| 2012 | 2560 | 3.0% |
| 2011 | 2429 | 2.8% |
| 2009 | 2298 | 2.7% |
| Other values (104) | 57302 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 74835 | |
| 0 | 72686 | |
| 9 | 58164 | |
| 2 | 56117 | |
| 8 | 17090 | 5.0% |
| 7 | 15889 | 4.6% |
| 6 | 14146 | 4.1% |
| 5 | 12623 | 3.7% |
| 4 | 11258 | 3.3% |
| 3 | 10612 | 3.1% |
| Other values (8) | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 343420 | |
| Lowercase Letter | 4 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 74835 | |
| 0 | 72686 | |
| 9 | 58164 | |
| 2 | 56117 | |
| 8 | 17090 | 5.0% |
| 7 | 15889 | 4.6% |
| 6 | 14146 | 4.1% |
| 5 | 12623 | 3.7% |
| 4 | 11258 | 3.3% |
| 3 | 10612 | 3.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| v | 1 | |
| i | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| V | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 343422 | |
| Latin | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 74835 | |
| 0 | 72686 | |
| 9 | 58164 | |
| 2 | 56117 | |
| 8 | 17090 | 5.0% |
| 7 | 15889 | 4.6% |
| 6 | 14146 | 4.1% |
| 5 | 12623 | 3.7% |
| 4 | 11258 | 3.3% |
| 3 | 10612 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| V | 1 | |
| M | 1 | |
| o | 1 | |
| v | 1 | |
| i | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 343429 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 74835 | |
| 0 | 72686 | |
| 9 | 58164 | |
| 2 | 56117 | |
| 8 | 17090 | 5.0% |
| 7 | 15889 | 4.6% |
| 6 | 14146 | 4.1% |
| 5 | 12623 | 3.7% |
| 4 | 11258 | 3.3% |
| 3 | 10612 | 3.1% |
| Other values (8) | 9 | < 0.1% |
| Distinct | 22012 |
|---|---|
| Distinct (%) | 25.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| 2010 | 113 |
|---|---|
| 2008 | 106 |
| 1997 | 100 |
| 1999 | 99 |
| 2009 | 96 |
| Other values (22007) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.681218333 |
| Min length | 4 |
Characters and Unicode
| Total characters | 831181 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8693 ? |
|---|---|
| Unique (%) | 10.1% |
Sample
| 1st row | 1894-10-09 |
|---|---|
| 2nd row | 1906-12-26 |
| 3rd row | 1911-08-19 |
| 4th row | 1912-11-13 |
| 5th row | 1911-03-06 |
Common Values
| Value | Count | Frequency (%) |
| 2010 | 113 | 0.1% |
| 2008 | 106 | 0.1% |
| 1997 | 100 | 0.1% |
| 1999 | 99 | 0.1% |
| 2009 | 96 | 0.1% |
| 1985 | 91 | 0.1% |
| 1996 | 90 | 0.1% |
| 1975 | 88 | 0.1% |
| 2011 | 88 | 0.1% |
| 1983 | 87 | 0.1% |
| Other values (22002) | 84897 |
Length
| Value | Count | Frequency (%) |
| 2010 | 113 | 0.1% |
| 2008 | 106 | 0.1% |
| 1997 | 100 | 0.1% |
| 1999 | 99 | 0.1% |
| 2009 | 96 | 0.1% |
| 1985 | 91 | 0.1% |
| 1996 | 90 | 0.1% |
| 1975 | 88 | 0.1% |
| 2011 | 88 | 0.1% |
| 1983 | 87 | 0.1% |
| Other values (22003) | 84899 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 175452 | |
| - | 162584 | |
| 1 | 149685 | |
| 2 | 102736 | |
| 9 | 72450 | |
| 8 | 30994 | 3.7% |
| 3 | 29311 | 3.5% |
| 7 | 28518 | 3.4% |
| 6 | 27158 | 3.3% |
| 5 | 26751 | 3.2% |
| Other values (9) | 25542 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 668588 | |
| Dash Punctuation | 162584 | 19.6% |
| Lowercase Letter | 4 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 175452 | |
| 1 | 149685 | |
| 2 | 102736 | |
| 9 | 72450 | |
| 8 | 30994 | 4.6% |
| 3 | 29311 | 4.4% |
| 7 | 28518 | 4.3% |
| 6 | 27158 | 4.1% |
| 5 | 26751 | 4.0% |
| 4 | 25533 | 3.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| v | 1 | |
| i | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| V | 1 | |
| M | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 162584 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 831174 | |
| Latin | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 175452 | |
| - | 162584 | |
| 1 | 149685 | |
| 2 | 102736 | |
| 9 | 72450 | |
| 8 | 30994 | 3.7% |
| 3 | 29311 | 3.5% |
| 7 | 28518 | 3.4% |
| 6 | 27158 | 3.3% |
| 5 | 26751 | 3.2% |
| Other values (2) | 25535 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| V | 1 | |
| M | 1 | |
| o | 1 | |
| v | 1 | |
| i | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 831181 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 175452 | |
| - | 162584 | |
| 1 | 149685 | |
| 2 | 102736 | |
| 9 | 72450 | |
| 8 | 30994 | 3.7% |
| 3 | 29311 | 3.5% |
| 7 | 28518 | 3.4% |
| 6 | 27158 | 3.3% |
| 5 | 26751 | 3.2% |
| Other values (9) | 25542 | 3.1% |
| Distinct | 1257 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 670.9 KiB |
| Drama | |
|---|---|
| Comedy | |
| Comedy, Drama | 4039 |
| Drama, Romance | 3455 |
| Comedy, Romance | 2508 |
| Other values (1252) |
Length
| Max length | 31 |
|---|---|
| Median length | 26 |
| Mean length | 14.64983985 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1257762 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 385 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Romance |
|---|---|
| 2nd row | Biography, Crime, Drama |
| 3rd row | Drama |
| 4th row | Drama, History |
| 5th row | Adventure, Drama, Fantasy |
Common Values
| Value | Count | Frequency (%) |
| Drama | 12543 | 14.6% |
| Comedy | 7693 | 9.0% |
| Comedy, Drama | 4039 | 4.7% |
| Drama, Romance | 3455 | 4.0% |
| Comedy, Romance | 2508 | 2.9% |
| Comedy, Drama, Romance | 2293 | 2.7% |
| Horror | 2268 | 2.6% |
| Drama, Thriller | 1348 | 1.6% |
| Crime, Drama | 1343 | 1.6% |
| Action, Crime, Drama | 1310 | 1.5% |
| Other values (1247) | 47055 |
Length
| Value | Count | Frequency (%) |
| drama | 47110 | |
| comedy | 29368 | |
| romance | 14128 | 8.0% |
| action | 12948 | 7.4% |
| thriller | 11388 | 6.5% |
| crime | 11067 | 6.3% |
| horror | 9557 | 5.4% |
| adventure | 7590 | 4.3% |
| mystery | 5225 | 3.0% |
| family | 3962 | 2.3% |
| Other values (15) | 23524 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 132666 | 10.5% |
| a | 128740 | 10.2% |
| m | 108441 | 8.6% |
| , | 90012 | 7.2% |
| 90012 | 7.2% | |
| e | 89528 | 7.1% |
| o | 84101 | 6.7% |
| i | 60595 | 4.8% |
| y | 52270 | 4.2% |
| D | 47112 | 3.7% |
| Other values (25) | 374285 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 893320 | |
| Uppercase Letter | 180144 | 14.3% |
| Other Punctuation | 90012 | 7.2% |
| Space Separator | 90012 | 7.2% |
| Dash Punctuation | 4274 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 132666 | |
| a | 128740 | |
| m | 108441 | |
| e | 89528 | |
| o | 84101 | |
| i | 60595 | |
| y | 52270 | 5.9% |
| n | 44345 | 5.0% |
| d | 36960 | 4.1% |
| t | 36666 | 4.1% |
| Other values (9) | 119008 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 47112 | |
| C | 40435 | |
| A | 22681 | |
| R | 14131 | 7.8% |
| F | 12045 | 6.7% |
| H | 11853 | 6.6% |
| T | 11391 | 6.3% |
| M | 8955 | 5.0% |
| S | 4672 | 2.6% |
| W | 3825 | 2.1% |
| Other values (3) | 3044 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 90012 |
Space Separator
| Value | Count | Frequency (%) |
| 90012 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1073464 | |
| Common | 184298 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 132666 | |
| a | 128740 | |
| m | 108441 | 10.1% |
| e | 89528 | 8.3% |
| o | 84101 | 7.8% |
| i | 60595 | 5.6% |
| y | 52270 | 4.9% |
| D | 47112 | 4.4% |
| n | 44345 | 4.1% |
| C | 40435 | 3.8% |
| Other values (22) | 285231 |
Common
| Value | Count | Frequency (%) |
| , | 90012 | |
| 90012 | ||
| - | 4274 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1257762 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 132666 | 10.5% |
| a | 128740 | 10.2% |
| m | 108441 | 8.6% |
| , | 90012 | 7.2% |
| 90012 | 7.2% | |
| e | 89528 | 7.1% |
| o | 84101 | 6.7% |
| i | 60595 | 4.8% |
| y | 52270 | 4.2% |
| D | 47112 | 3.7% |
| Other values (25) | 374285 |
duration
Real number (ℝ≥0)
| Distinct | 266 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.3514181 |
| Minimum | 41 |
|---|---|
| Maximum | 808 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 41 |
|---|---|
| 5-th percentile | 73 |
| Q1 | 88 |
| median | 96 |
| Q3 | 108 |
| 95-th percentile | 142 |
| Maximum | 808 |
| Range | 767 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 22.55384799 |
|---|---|
| Coefficient of variation (CV) | 0.2247486724 |
| Kurtosis | 40.30157626 |
| Mean | 100.3514181 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 3.079705205 |
| Sum | 8615671 |
| Variance | 508.6760589 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 5162 | 6.0% |
| 95 | 3194 | 3.7% |
| 100 | 3106 | 3.6% |
| 92 | 2418 | 2.8% |
| 93 | 2414 | 2.8% |
| 85 | 2308 | 2.7% |
| 88 | 2228 | 2.6% |
| 94 | 2193 | 2.6% |
| 96 | 2177 | 2.5% |
| 91 | 2132 | 2.5% |
| Other values (256) | 58523 |
| Value | Count | Frequency (%) |
| 41 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 45 | 62 | |
| 46 | 26 | < 0.1% |
| 47 | 26 | < 0.1% |
| 48 | 36 | |
| 49 | 19 | < 0.1% |
| 50 | 72 |
| Value | Count | Frequency (%) |
| 808 | 1 | < 0.1% |
| 729 | 1 | < 0.1% |
| 580 | 1 | < 0.1% |
| 570 | 1 | < 0.1% |
| 540 | 3 | |
| 485 | 1 | < 0.1% |
| 450 | 1 | < 0.1% |
| 442 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 421 | 1 | < 0.1% |
| Distinct | 4907 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 64 |
| Missing (%) | 0.1% |
| Memory size | 670.9 KiB |
| USA | |
|---|---|
| India | |
| UK | |
| Japan | 3077 |
| France | 3055 |
| Other values (4902) |
Length
| Max length | 225 |
|---|---|
| Median length | 110 |
| Mean length | 7.24057302 |
| Min length | 2 |
Characters and Unicode
| Total characters | 621176 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3614 ? |
|---|---|
| Unique (%) | 4.2% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | Australia |
| 3rd row | Germany, Denmark |
| 4th row | USA |
| 5th row | Italy |
Common Values
| Value | Count | Frequency (%) |
| USA | 28511 | |
| India | 6065 | 7.1% |
| UK | 4111 | 4.8% |
| Japan | 3077 | 3.6% |
| France | 3055 | 3.6% |
| Italy | 2444 | 2.8% |
| Canada | 1802 | 2.1% |
| Germany | 1396 | 1.6% |
| Turkey | 1351 | 1.6% |
| Hong Kong | 1239 | 1.4% |
| Other values (4897) | 32740 |
Length
| Value | Count | Frequency (%) |
| usa | 34325 | |
| france | 8311 | 7.2% |
| uk | 7490 | 6.4% |
| india | 6373 | 5.5% |
| italy | 5056 | 4.4% |
| germany | 4909 | 4.2% |
| japan | 3701 | 3.2% |
| canada | 3621 | 3.1% |
| spain | 2731 | 2.4% |
| hong | 1884 | 1.6% |
| Other values (211) | 37739 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 72788 | 11.7% |
| n | 50990 | 8.2% |
| U | 42996 | 6.9% |
| S | 42087 | 6.8% |
| A | 37428 | 6.0% |
| e | 36290 | 5.8% |
| 30349 | 4.9% | |
| r | 29118 | 4.7% |
| i | 28551 | 4.6% |
| , | 22991 | 3.7% |
| Other values (48) | 227588 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 375790 | |
| Uppercase Letter | 192037 | |
| Space Separator | 30349 | 4.9% |
| Other Punctuation | 22997 | 3.7% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 72788 | |
| n | 50990 | |
| e | 36290 | |
| r | 29118 | 7.7% |
| i | 28551 | 7.6% |
| l | 17254 | 4.6% |
| d | 16614 | 4.4% |
| o | 16082 | 4.3% |
| t | 15164 | 4.0% |
| y | 13552 | 3.6% |
| Other values (17) | 79387 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 42996 | |
| S | 42087 | |
| A | 37428 | |
| I | 13509 | 7.0% |
| K | 10777 | 5.6% |
| F | 9079 | 4.7% |
| C | 6333 | 3.3% |
| G | 5771 | 3.0% |
| J | 3742 | 1.9% |
| B | 2891 | 1.5% |
| Other values (15) | 17424 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22991 | |
| ' | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30349 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 567827 | |
| Common | 53349 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 72788 | 12.8% |
| n | 50990 | 9.0% |
| U | 42996 | 7.6% |
| S | 42087 | 7.4% |
| A | 37428 | 6.6% |
| e | 36290 | 6.4% |
| r | 29118 | 5.1% |
| i | 28551 | 5.0% |
| l | 17254 | 3.0% |
| d | 16614 | 2.9% |
| Other values (42) | 193711 |
Common
| Value | Count | Frequency (%) |
| 30349 | ||
| , | 22991 | |
| ' | 6 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 621170 | |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 72788 | 11.7% |
| n | 50990 | 8.2% |
| U | 42996 | 6.9% |
| S | 42087 | 6.8% |
| A | 37428 | 6.0% |
| e | 36290 | 5.8% |
| 30349 | 4.9% | |
| r | 29118 | 4.7% |
| i | 28551 | 4.6% |
| , | 22991 | 3.7% |
| Other values (47) | 227582 |
None
| Value | Count | Frequency (%) |
| ô | 6 |
| Distinct | 4377 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 833 |
| Missing (%) | 1.0% |
| Memory size | 670.9 KiB |
| English | |
|---|---|
| French | |
| Spanish | 2831 |
| Japanese | 2826 |
| Italian | 2731 |
| Other values (4372) |
Length
| Max length | 163 |
|---|---|
| Median length | 7 |
| Mean length | 9.476206158 |
| Min length | 3 |
Characters and Unicode
| Total characters | 805686 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3175 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | English |
| 4th row | Italian |
| 5th row | English |
Common Values
| Value | Count | Frequency (%) |
| English | 35939 | |
| French | 3903 | 4.5% |
| Spanish | 2831 | 3.3% |
| Japanese | 2826 | 3.3% |
| Italian | 2731 | 3.2% |
| Hindi | 2106 | 2.5% |
| German | 1761 | 2.1% |
| Turkish | 1355 | 1.6% |
| Russian | 1345 | 1.6% |
| English, Spanish | 1108 | 1.3% |
| Other values (4367) | 29117 |
Length
| Value | Count | Frequency (%) |
| english | 47453 | |
| french | 8164 | 7.4% |
| spanish | 5685 | 5.2% |
| italian | 4677 | 4.3% |
| german | 4606 | 4.2% |
| japanese | 3888 | 3.5% |
| hindi | 2949 | 2.7% |
| russian | 2816 | 2.6% |
| mandarin | 1946 | 1.8% |
| turkish | 1666 | 1.5% |
| Other values (258) | 25947 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 101920 | |
| i | 86785 | |
| s | 73457 | 9.1% |
| h | 69711 | 8.7% |
| l | 59572 | 7.4% |
| a | 57434 | 7.1% |
| g | 52934 | 6.6% |
| E | 47597 | 5.9% |
| e | 38510 | 4.8% |
| r | 26491 | 3.3% |
| Other values (51) | 191275 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 646212 | |
| Uppercase Letter | 110089 | 13.7% |
| Space Separator | 24775 | 3.1% |
| Other Punctuation | 24199 | 3.0% |
| Dash Punctuation | 333 | < 0.1% |
| Decimal Number | 40 | < 0.1% |
| Open Punctuation | 19 | < 0.1% |
| Close Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 101920 | |
| i | 86785 | |
| s | 73457 | |
| h | 69711 | |
| l | 59572 | |
| a | 57434 | |
| g | 52934 | |
| e | 38510 | 6.0% |
| r | 26491 | 4.1% |
| u | 11995 | 1.9% |
| Other values (16) | 67403 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 47597 | |
| F | 9134 | 8.3% |
| S | 8151 | 7.4% |
| G | 5680 | 5.2% |
| I | 5199 | 4.7% |
| T | 4491 | 4.1% |
| H | 4102 | 3.7% |
| J | 3888 | 3.5% |
| M | 3414 | 3.1% |
| P | 3353 | 3.0% |
| Other values (16) | 15080 | 13.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10 | |
| 4 | 10 | |
| 5 | 10 | |
| 3 | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 24775 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 24199 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 333 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 19 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 756301 | |
| Common | 49385 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 101920 | |
| i | 86785 | |
| s | 73457 | |
| h | 69711 | |
| l | 59572 | |
| a | 57434 | |
| g | 52934 | 7.0% |
| E | 47597 | 6.3% |
| e | 38510 | 5.1% |
| r | 26491 | 3.5% |
| Other values (42) | 141890 |
Common
| Value | Count | Frequency (%) |
| 24775 | ||
| , | 24199 | |
| - | 333 | 0.7% |
| ( | 19 | < 0.1% |
| ) | 19 | < 0.1% |
| 1 | 10 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 10 | < 0.1% |
| 3 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 805686 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 101920 | |
| i | 86785 | |
| s | 73457 | 9.1% |
| h | 69711 | 8.7% |
| l | 59572 | 7.4% |
| a | 57434 | 7.1% |
| g | 52934 | 6.6% |
| E | 47597 | 5.9% |
| e | 38510 | 4.8% |
| r | 26491 | 3.3% |
| Other values (51) | 191275 |
| Distinct | 34733 |
|---|---|
| Distinct (%) | 40.5% |
| Missing | 87 |
| Missing (%) | 0.1% |
| Memory size | 670.9 KiB |
| Jesús Franco | 87 |
|---|---|
| Michael Curtiz | 85 |
| Lesley Selander | 78 |
| Lloyd Bacon | 73 |
| William Beaudine | 70 |
| Other values (34728) |
Length
| Max length | 62 |
|---|---|
| Median length | 52 |
| Mean length | 14.65699328 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1257101 |
|---|---|
| Distinct characters | 105 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 21465 ? |
|---|---|
| Unique (%) | 25.0% |
Sample
| 1st row | Alexander Black |
|---|---|
| 2nd row | Charles Tait |
| 3rd row | Urban Gad |
| 4th row | Charles L. Gaskill |
| 5th row | Francesco Bertolini, Adolfo Padovan |
Common Values
| Value | Count | Frequency (%) |
| Jesús Franco | 87 | 0.1% |
| Michael Curtiz | 85 | 0.1% |
| Lesley Selander | 78 | 0.1% |
| Lloyd Bacon | 73 | 0.1% |
| William Beaudine | 70 | 0.1% |
| Richard Thorpe | 68 | 0.1% |
| John Ford | 67 | 0.1% |
| Gordon Douglas | 64 | 0.1% |
| Raoul Walsh | 61 | 0.1% |
| Mervyn LeRoy | 59 | 0.1% |
| Other values (34723) | 85056 | |
| (Missing) | 87 | 0.1% |
Length
| Value | Count | Frequency (%) |
| john | 1672 | 0.9% |
| david | 1319 | 0.7% |
| michael | 1316 | 0.7% |
| robert | 1196 | 0.6% |
| william | 913 | 0.5% |
| richard | 847 | 0.4% |
| peter | 741 | 0.4% |
| de | 736 | 0.4% |
| james | 721 | 0.4% |
| paul | 703 | 0.4% |
| Other values (31305) | 182945 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 118249 | 9.4% |
| 107341 | 8.5% | |
| e | 101071 | 8.0% |
| i | 83255 | 6.6% |
| n | 82271 | 6.5% |
| r | 81904 | 6.5% |
| o | 71171 | 5.7% |
| l | 54291 | 4.3% |
| s | 43948 | 3.5% |
| t | 39334 | 3.1% |
| Other values (95) | 474266 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 936765 | |
| Uppercase Letter | 196738 | 15.7% |
| Space Separator | 107341 | 8.5% |
| Other Punctuation | 13124 | 1.0% |
| Dash Punctuation | 3132 | 0.2% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 118249 | |
| e | 101071 | |
| i | 83255 | 8.9% |
| n | 82271 | 8.8% |
| r | 81904 | 8.7% |
| o | 71171 | 7.6% |
| l | 54291 | 5.8% |
| s | 43948 | 4.7% |
| t | 39334 | 4.2% |
| h | 36152 | 3.9% |
| Other values (46) | 225119 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 17202 | 8.7% |
| M | 17033 | 8.7% |
| J | 13224 | 6.7% |
| A | 13007 | 6.6% |
| R | 12412 | 6.3% |
| C | 12100 | 6.2% |
| B | 11788 | 6.0% |
| D | 10144 | 5.2% |
| L | 9622 | 4.9% |
| G | 9376 | 4.8% |
| Other values (32) | 70830 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6689 | |
| , | 5826 | |
| ' | 607 | 4.6% |
| " | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 107341 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3132 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1133503 | |
| Common | 123598 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 118249 | 10.4% |
| e | 101071 | 8.9% |
| i | 83255 | 7.3% |
| n | 82271 | 7.3% |
| r | 81904 | 7.2% |
| o | 71171 | 6.3% |
| l | 54291 | 4.8% |
| s | 43948 | 3.9% |
| t | 39334 | 3.5% |
| h | 36152 | 3.2% |
| Other values (88) | 421857 |
Common
| Value | Count | Frequency (%) |
| 107341 | ||
| . | 6689 | 5.4% |
| , | 5826 | 4.7% |
| - | 3132 | 2.5% |
| ' | 607 | 0.5% |
| " | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1248298 | |
| None | 8803 | 0.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 118249 | 9.5% |
| 107341 | 8.6% | |
| e | 101071 | 8.1% |
| i | 83255 | 6.7% |
| n | 82271 | 6.6% |
| r | 81904 | 6.6% |
| o | 71171 | 5.7% |
| l | 54291 | 4.3% |
| s | 43948 | 3.5% |
| t | 39334 | 3.2% |
| Other values (49) | 465463 |
None
| Value | Count | Frequency (%) |
| é | 2084 | |
| á | 1156 | |
| ô | 709 | 8.1% |
| í | 648 | 7.4% |
| ó | 587 | 6.7% |
| ö | 554 | 6.3% |
| ü | 499 | 5.7% |
| ç | 300 | 3.4% |
| ä | 212 | 2.4% |
| Ö | 175 | 2.0% |
| Other values (36) | 1879 |
| Distinct | 66859 |
|---|---|
| Distinct (%) | 79.3% |
| Missing | 1572 |
| Missing (%) | 1.8% |
| Memory size | 670.9 KiB |
| Jing Wong | 84 |
|---|---|
| Kuang Ni | 45 |
| Woody Allen | 40 |
| Erdogan Tünas | 35 |
| Leonardo Benvenuti, Piero De Bernardi | 34 |
| Other values (66854) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 24.00411708 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2023139 |
|---|---|
| Distinct characters | 113 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 58034 ? |
|---|---|
| Unique (%) | 68.9% |
Sample
| 1st row | Alexander Black |
|---|---|
| 2nd row | Charles Tait |
| 3rd row | Urban Gad, Gebhard Schätzler-Perasini |
| 4th row | Victorien Sardou |
| 5th row | Dante Alighieri |
Common Values
| Value | Count | Frequency (%) |
| Jing Wong | 84 | 0.1% |
| Kuang Ni | 45 | 0.1% |
| Woody Allen | 40 | < 0.1% |
| Erdogan Tünas | 35 | < 0.1% |
| Leonardo Benvenuti, Piero De Bernardi | 34 | < 0.1% |
| Carlo Vanzina, Enrico Vanzina | 32 | < 0.1% |
| Cheh Chang, Kuang Ni | 31 | < 0.1% |
| Giannis Dalianidis | 29 | < 0.1% |
| Ingmar Bergman | 27 | < 0.1% |
| Safa Önal | 27 | < 0.1% |
| Other values (66849) | 83899 | |
| (Missing) | 1572 | 1.8% |
Length
| Value | Count | Frequency (%) |
| john | 2418 | 0.8% |
| david | 1853 | 0.6% |
| robert | 1843 | 0.6% |
| michael | 1772 | 0.6% |
| james | 1215 | 0.4% |
| paul | 1135 | 0.4% |
| de | 1111 | 0.4% |
| richard | 1072 | 0.4% |
| william | 1008 | 0.3% |
| peter | 924 | 0.3% |
| Other values (48157) | 280912 |
Most occurring characters
| Value | Count | Frequency (%) |
| 210980 | 10.4% | |
| a | 183777 | 9.1% |
| e | 156493 | 7.7% |
| n | 128306 | 6.3% |
| r | 126376 | 6.2% |
| i | 124811 | 6.2% |
| o | 109484 | 5.4% |
| l | 83330 | 4.1% |
| s | 68336 | 3.4% |
| t | 61306 | 3.0% |
| Other values (103) | 769940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1438844 | |
| Uppercase Letter | 301943 | 14.9% |
| Space Separator | 210980 | 10.4% |
| Other Punctuation | 66855 | 3.3% |
| Dash Punctuation | 4503 | 0.2% |
| Decimal Number | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 183777 | |
| e | 156493 | |
| n | 128306 | 8.9% |
| r | 126376 | 8.8% |
| i | 124811 | 8.7% |
| o | 109484 | 7.6% |
| l | 83330 | 5.8% |
| s | 68336 | 4.7% |
| t | 61306 | 4.3% |
| h | 54751 | 3.8% |
| Other values (48) | 341874 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 26301 | 8.7% |
| S | 24660 | 8.2% |
| J | 20870 | 6.9% |
| A | 20564 | 6.8% |
| B | 20027 | 6.6% |
| C | 19493 | 6.5% |
| R | 17550 | 5.8% |
| D | 15814 | 5.2% |
| G | 14909 | 4.9% |
| K | 14486 | 4.8% |
| Other values (32) | 107269 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 5 | 3 | |
| 7 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 9 | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 55238 | |
| . | 10608 | 15.9% |
| ' | 1006 | 1.5% |
| " | 2 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 210980 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1740787 | |
| Common | 282352 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 183777 | 10.6% |
| e | 156493 | 9.0% |
| n | 128306 | 7.4% |
| r | 126376 | 7.3% |
| i | 124811 | 7.2% |
| o | 109484 | 6.3% |
| l | 83330 | 4.8% |
| s | 68336 | 3.9% |
| t | 61306 | 3.5% |
| h | 54751 | 3.1% |
| Other values (90) | 643817 |
Common
| Value | Count | Frequency (%) |
| 210980 | ||
| , | 55238 | 19.6% |
| . | 10608 | 3.8% |
| - | 4503 | 1.6% |
| ' | 1006 | 0.4% |
| 0 | 4 | < 0.1% |
| 5 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| " | 2 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2009960 | |
| None | 13179 | 0.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 210980 | 10.5% | |
| a | 183777 | 9.1% |
| e | 156493 | 7.8% |
| n | 128306 | 6.4% |
| r | 126376 | 6.3% |
| i | 124811 | 6.2% |
| o | 109484 | 5.4% |
| l | 83330 | 4.1% |
| s | 68336 | 3.4% |
| t | 61306 | 3.1% |
| Other values (55) | 756761 |
None
| Value | Count | Frequency (%) |
| é | 2927 | |
| á | 1798 | |
| ô | 1175 | |
| í | 1019 | 7.7% |
| ó | 876 | 6.6% |
| ü | 795 | 6.0% |
| ö | 730 | 5.5% |
| ç | 448 | 3.4% |
| è | 380 | 2.9% |
| ä | 340 | 2.6% |
| Other values (38) | 2691 |
| Distinct | 32050 |
|---|---|
| Distinct (%) | 39.4% |
| Missing | 4455 |
| Missing (%) | 5.2% |
| Memory size | 670.9 KiB |
| Metro-Goldwyn-Mayer (MGM) | 1284 |
|---|---|
| Warner Bros. | 1153 |
| Columbia Pictures | 914 |
| Paramount Pictures | 903 |
| Twentieth Century Fox | 865 |
| Other values (32045) |
Length
| Max length | 101 |
|---|---|
| Median length | 75 |
| Mean length | 18.26003686 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1486367 |
|---|---|
| Distinct characters | 129 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 22126 ? |
|---|---|
| Unique (%) | 27.2% |
Sample
| 1st row | Alexander Black Photoplays |
|---|---|
| 2nd row | J. and N. Tait |
| 3rd row | Fotorama |
| 4th row | Helen Gardner Picture Players |
| 5th row | Milano Film |
Common Values
| Value | Count | Frequency (%) |
| Metro-Goldwyn-Mayer (MGM) | 1284 | 1.5% |
| Warner Bros. | 1153 | 1.3% |
| Columbia Pictures | 914 | 1.1% |
| Paramount Pictures | 903 | 1.1% |
| Twentieth Century Fox | 865 | 1.0% |
| Universal Pictures | 732 | 0.9% |
| RKO Radio Pictures | 535 | 0.6% |
| Mosfilm | 279 | 0.3% |
| Universal International Pictures (UI) | 272 | 0.3% |
| Canal+ | 231 | 0.3% |
| Other values (32040) | 74232 | |
| (Missing) | 4455 | 5.2% |
Length
| Value | Count | Frequency (%) |
| films | 11429 | 5.6% |
| productions | 11148 | 5.5% |
| pictures | 9826 | 4.9% |
| film | 9384 | 4.6% |
| entertainment | 5027 | 2.5% |
| company | 1990 | 1.0% |
| international | 1676 | 0.8% |
| production | 1431 | 0.7% |
| metro-goldwyn-mayer | 1309 | 0.6% |
| mgm | 1285 | 0.6% |
| Other values (25140) | 148036 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 123215 | 8.3% |
| 121141 | 8.2% | |
| e | 100096 | 6.7% |
| n | 96877 | 6.5% |
| o | 96222 | 6.5% |
| r | 91195 | 6.1% |
| t | 90440 | 6.1% |
| a | 87994 | 5.9% |
| s | 71570 | 4.8% |
| l | 60630 | 4.1% |
| Other values (119) | 546987 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1107193 | |
| Uppercase Letter | 226962 | 15.3% |
| Space Separator | 121141 | 8.2% |
| Other Punctuation | 11473 | 0.8% |
| Dash Punctuation | 5272 | 0.4% |
| Open Punctuation | 4738 | 0.3% |
| Close Punctuation | 4737 | 0.3% |
| Decimal Number | 4464 | 0.3% |
| Math Symbol | 365 | < 0.1% |
| Connector Punctuation | 12 | < 0.1% |
| Other values (2) | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 123215 | |
| e | 100096 | |
| n | 96877 | |
| o | 96222 | |
| r | 91195 | 8.2% |
| t | 90440 | 8.2% |
| a | 87994 | 7.9% |
| s | 71570 | 6.5% |
| l | 60630 | 5.5% |
| m | 53032 | 4.8% |
| Other values (45) | 235922 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 32415 | |
| P | 32019 | |
| C | 22473 | 9.9% |
| M | 15789 | 7.0% |
| A | 13965 | 6.2% |
| S | 13249 | 5.8% |
| B | 10674 | 4.7% |
| E | 10339 | 4.6% |
| G | 9267 | 4.1% |
| I | 8393 | 3.7% |
| Other values (28) | 58379 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8575 | |
| & | 1258 | 11.0% |
| ' | 625 | 5.4% |
| / | 495 | 4.3% |
| " | 230 | 2.0% |
| , | 192 | 1.7% |
| ! | 55 | 0.5% |
| : | 17 | 0.1% |
| @ | 8 | 0.1% |
| % | 6 | 0.1% |
| Other values (3) | 12 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 928 | |
| 0 | 775 | |
| 1 | 679 | |
| 3 | 541 | |
| 4 | 490 | |
| 5 | 260 | 5.8% |
| 7 | 248 | 5.6% |
| 9 | 211 | 4.7% |
| 8 | 198 | 4.4% |
| 6 | 134 | 3.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 363 | |
| ~ | 1 | 0.3% |
| = | 1 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4735 | |
| [ | 3 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4734 | |
| ] | 3 | 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ² | 4 | |
| ½ | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 121141 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5272 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1334155 | |
| Common | 152212 | 10.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 123215 | 9.2% |
| e | 100096 | 7.5% |
| n | 96877 | 7.3% |
| o | 96222 | 7.2% |
| r | 91195 | 6.8% |
| t | 90440 | 6.8% |
| a | 87994 | 6.6% |
| s | 71570 | 5.4% |
| l | 60630 | 4.5% |
| m | 53032 | 4.0% |
| Other values (83) | 462884 |
Common
| Value | Count | Frequency (%) |
| 121141 | ||
| . | 8575 | 5.6% |
| - | 5272 | 3.5% |
| ( | 4735 | 3.1% |
| ) | 4734 | 3.1% |
| & | 1258 | 0.8% |
| 2 | 928 | 0.6% |
| 0 | 775 | 0.5% |
| 1 | 679 | 0.4% |
| ' | 625 | 0.4% |
| Other values (26) | 3490 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1480947 | |
| None | 5420 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 123215 | 8.3% |
| 121141 | 8.2% | |
| e | 100096 | 6.8% |
| n | 96877 | 6.5% |
| o | 96222 | 6.5% |
| r | 91195 | 6.2% |
| t | 90440 | 6.1% |
| a | 87994 | 5.9% |
| s | 71570 | 4.8% |
| l | 60630 | 4.1% |
| Other values (75) | 541567 |
None
| Value | Count | Frequency (%) |
| é | 2063 | |
| á | 708 | 13.1% |
| ó | 576 | 10.6% |
| í | 256 | 4.7% |
| ç | 253 | 4.7% |
| ü | 251 | 4.6% |
| ú | 176 | 3.2% |
| ñ | 147 | 2.7% |
| õ | 142 | 2.6% |
| ö | 134 | 2.5% |
| Other values (34) | 714 | 13.2% |
| Distinct | 85729 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 69 |
| Missing (%) | 0.1% |
| Memory size | 670.9 KiB |
| Nobuyo Ôyama, Noriko Ohara, Michiko Nomura, Kaneta Kimotsuki, Kazuya Tatekabe | 9 |
|---|---|
| Sergey A. | 6 |
| Bill Corbett, Kevin Murphy, Michael J. Nelson | 6 |
| Keiji Fujiwara, Satomi Kôrogi, Miki Narahashi, Akiko Yajima | 4 |
| Trace Beaulieu, Frank Conniff, Joel Hodgson, Mary Jo Pehl, J. Elvis Weinstein | 3 |
| Other values (85724) |
Length
| Max length | 415 |
|---|---|
| Median length | 312 |
| Mean length | 205.166892 |
| Min length | 7 |
Characters and Unicode
| Total characters | 17600447 |
|---|---|
| Distinct characters | 131 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 85693 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | Blanche Bayliss, William Courtenay, Chauncey Depew |
|---|---|
| 2nd row | Elizabeth Tait, John Tait, Norman Campbell, Bella Cola, Will Coyne, Sam Crewes, Jack Ennis, John Forde, Vera Linden, Mr. Marshall, Mr. McKenzie, Frank Mills, Ollie Wilson |
| 3rd row | Asta Nielsen, Valdemar Psilander, Gunnar Helsengreen, Emil Albes, Hugo Flink, Mary Hagen |
| 4th row | Helen Gardner, Pearl Sindelar, Miss Fielding, Miss Robson, Helene Costello, Charles Sindelar, Mr. Howard, James R. Waite, Mr. Osborne, Harry Knowles, Mr. Paul, Mr. Brady, Mr. Corker |
| 5th row | Salvatore Papa, Arturo Pirovano, Giuseppe de Liguoro, Pier Delle Vigne, Augusto Milla, Attilio Motta, Emilise Beretta |
Common Values
| Value | Count | Frequency (%) |
| Nobuyo Ôyama, Noriko Ohara, Michiko Nomura, Kaneta Kimotsuki, Kazuya Tatekabe | 9 | < 0.1% |
| Sergey A. | 6 | < 0.1% |
| Bill Corbett, Kevin Murphy, Michael J. Nelson | 6 | < 0.1% |
| Keiji Fujiwara, Satomi Kôrogi, Miki Narahashi, Akiko Yajima | 4 | < 0.1% |
| Trace Beaulieu, Frank Conniff, Joel Hodgson, Mary Jo Pehl, J. Elvis Weinstein | 3 | < 0.1% |
| Richard Pryor | 3 | < 0.1% |
| Ian McKellen, Martin Freeman, Richard Armitage, Ken Stott, Graham McTavish, William Kircher, James Nesbitt, Stephen Hunter, Dean O'Gorman, Aidan Turner, John Callen, Peter Hambleton, Jed Brophy, Mark Hadlow, Adam Brown | 3 | < 0.1% |
| Mike Stoklasa | 3 | < 0.1% |
| Tomoki Hirose, Yûki Hiyori, Rin Ishikawa, Itsuki Sagara, Yukihiro Takiguchi, Hinako Tanaka, James Takeshi Yamada | 2 | < 0.1% |
| H.B. Halicki, Marion Busia, Jerry Daugirda, James McIntyre, George Cole, Ronald Halicki, Markos Kotsikos, Parnelli Jones, Gary Bettenhausen, Jonathan E. Fricke, Hal McClain, J.C. Agajanian, J.C. Agajanian Jr., Christopher J.C. Agajanian, Billy Englehart | 2 | < 0.1% |
| Other values (85719) | 85745 | |
| (Missing) | 69 | 0.1% |
Length
| Value | Count | Frequency (%) |
| john | 13834 | 0.6% |
| michael | 11431 | 0.5% |
| david | 9725 | 0.4% |
| robert | 8917 | 0.4% |
| james | 8225 | 0.3% |
| de | 7155 | 0.3% |
| richard | 6857 | 0.3% |
| paul | 6811 | 0.3% |
| lee | 6235 | 0.3% |
| peter | 6143 | 0.3% |
| Other values (209566) | 2294434 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2293981 | 13.0% | |
| a | 1593877 | 9.1% |
| e | 1283545 | 7.3% |
| , | 1069507 | 6.1% |
| n | 1063343 | 6.0% |
| i | 1048544 | 6.0% |
| r | 996016 | 5.7% |
| o | 852330 | 4.8% |
| l | 708858 | 4.0% |
| s | 533575 | 3.0% |
| Other values (121) | 6156871 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11731527 | |
| Uppercase Letter | 2427273 | 13.8% |
| Space Separator | 2293981 | 13.0% |
| Other Punctuation | 1111873 | 6.3% |
| Dash Punctuation | 35668 | 0.2% |
| Decimal Number | 122 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
| Other Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1593877 | |
| e | 1283545 | |
| n | 1063343 | 9.1% |
| i | 1048544 | 8.9% |
| r | 996016 | 8.5% |
| o | 852330 | 7.3% |
| l | 708858 | 6.0% |
| s | 533575 | 4.5% |
| t | 502791 | 4.3% |
| h | 425908 | 3.6% |
| Other values (48) | 2722740 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 223504 | 9.2% |
| S | 197996 | 8.2% |
| A | 166141 | 6.8% |
| B | 165554 | 6.8% |
| C | 164832 | 6.8% |
| J | 151829 | 6.3% |
| R | 133317 | 5.5% |
| D | 128135 | 5.3% |
| L | 121460 | 5.0% |
| K | 120119 | 4.9% |
| Other values (39) | 854386 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 28 | |
| 5 | 22 | |
| 1 | 21 | |
| 2 | 13 | |
| 4 | 12 | |
| 6 | 8 | 6.6% |
| 3 | 6 | 4.9% |
| 9 | 5 | 4.1% |
| 7 | 4 | 3.3% |
| 8 | 3 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1069507 | |
| . | 30041 | 2.7% |
| ' | 12254 | 1.1% |
| & | 30 | < 0.1% |
| ! | 16 | < 0.1% |
| " | 12 | < 0.1% |
| : | 10 | < 0.1% |
| * | 2 | < 0.1% |
| @ | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2293981 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35668 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14158801 | |
| Common | 3441646 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1593877 | 11.3% |
| e | 1283545 | 9.1% |
| n | 1063343 | 7.5% |
| i | 1048544 | 7.4% |
| r | 996016 | 7.0% |
| o | 852330 | 6.0% |
| l | 708858 | 5.0% |
| s | 533575 | 3.8% |
| t | 502791 | 3.6% |
| h | 425908 | 3.0% |
| Other values (98) | 5150014 |
Common
| Value | Count | Frequency (%) |
| 2293981 | ||
| , | 1069507 | |
| - | 35668 | 1.0% |
| . | 30041 | 0.9% |
| ' | 12254 | 0.4% |
| & | 30 | < 0.1% |
| 0 | 28 | < 0.1% |
| 5 | 22 | < 0.1% |
| 1 | 21 | < 0.1% |
| ! | 16 | < 0.1% |
| Other values (13) | 78 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17492911 | |
| None | 107535 | 0.6% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2293981 | 13.1% | |
| a | 1593877 | 9.1% |
| e | 1283545 | 7.3% |
| , | 1069507 | 6.1% |
| n | 1063343 | 6.1% |
| i | 1048544 | 6.0% |
| r | 996016 | 5.7% |
| o | 852330 | 4.9% |
| l | 708858 | 4.1% |
| s | 533575 | 3.1% |
| Other values (64) | 6049335 |
None
| Value | Count | Frequency (%) |
| é | 23755 | |
| á | 15596 | |
| í | 9129 | 8.5% |
| ô | 9087 | 8.5% |
| ü | 6859 | 6.4% |
| ó | 6500 | 6.0% |
| ö | 5900 | 5.5% |
| ç | 3944 | 3.7% |
| è | 3089 | 2.9% |
| ä | 2528 | 2.4% |
| Other values (46) | 21148 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
| Distinct | 83611 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 2115 |
| Missing (%) | 2.5% |
| Memory size | 670.9 KiB |
| The story of | 15 |
|---|---|
| 6 | |
| The true story of | 5 |
| In this sequel to | 5 |
| Based on | 5 |
| Other values (83606) |
Length
| Max length | 402 |
|---|---|
| Median length | 336 |
| Mean length | 160.0632314 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13403695 |
|---|---|
| Distinct characters | 165 |
| Distinct categories | 20 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 83527 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | The adventures of a female reporter in the 1890s. |
|---|---|
| 2nd row | True story of notorious Australian outlaw Ned Kelly (1855-80). |
| 3rd row | Two men of high rank are both wooing the beautiful and famous equestrian acrobat Stella. While Stella ignores the jeweler Hirsch, she accepts Count von Waldberg's offer to follow her home, ... |
| 4th row | The fabled queen of Egypt's affair with Roman general Marc Antony is ultimately disastrous for both of them. |
| 5th row | Loosely adapted from Dante's Divine Comedy and inspired by the illustrations of Gustav Doré the original silent film has been restored and has a new score by Tangerine Dream. |
Common Values
| Value | Count | Frequency (%) |
| The story of | 15 | < 0.1% |
| 6 | < 0.1% | |
| The true story of | 5 | < 0.1% |
| In this sequel to | 5 | < 0.1% |
| Based on | 5 | < 0.1% |
| Emil goes to Berlin to see his grandmother with a large amount of money and is offered sweets by a strange man that make him sleep. He wakes up at his stop with no money. It is up to him and a group of children to save the day. | 4 | < 0.1% |
| Tom Sawyer and his pal Huckleberry Finn have great adventures on the Mississippi River, pretending to be pirates, attending their own funeral and witnessing a murder. | 4 | < 0.1% |
| Desperate measures are taken by a man who tries to save his family from the dark side of the law, after they commit an unexpected crime. | 4 | < 0.1% |
| During World War II, a teenage Jewish girl named Anne Frank and her family are forced into hiding in the Nazi-occupied Netherlands. | 4 | < 0.1% |
| After she loses her mobile phone, a lawyer receives a call from the person who found it. They talk and hit it off very quickly. But she's in shock when she sees that he's very short. | 3 | < 0.1% |
| Other values (83601) | 83685 | |
| (Missing) | 2115 | 2.5% |
Length
| Value | Count | Frequency (%) |
| a | 130832 | 5.6% |
| the | 111636 | 4.8% |
| to | 71782 | 3.1% |
| of | 65803 | 2.8% |
| and | 61890 | 2.7% |
| in | 51753 | 2.2% |
| his | 38317 | 1.6% |
| is | 35964 | 1.5% |
| with | 23353 | 1.0% |
| her | 22794 | 1.0% |
| Other values (84674) | 1715088 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2245469 | ||
| e | 1256194 | 9.4% |
| a | 893557 | 6.7% |
| t | 845626 | 6.3% |
| i | 800526 | 6.0% |
| o | 777423 | 5.8% |
| n | 767479 | 5.7% |
| r | 713250 | 5.3% |
| s | 712075 | 5.3% |
| h | 546308 | 4.1% |
| Other values (155) | 3845788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10396199 | |
| Space Separator | 2245476 | 16.8% |
| Uppercase Letter | 342324 | 2.6% |
| Other Punctuation | 336402 | 2.5% |
| Decimal Number | 39039 | 0.3% |
| Dash Punctuation | 30712 | 0.2% |
| Open Punctuation | 6804 | 0.1% |
| Close Punctuation | 6352 | < 0.1% |
| Currency Symbol | 285 | < 0.1% |
| Math Symbol | 29 | < 0.1% |
| Other values (10) | 73 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1256194 | |
| a | 893557 | 8.6% |
| t | 845626 | 8.1% |
| i | 800526 | 7.7% |
| o | 777423 | 7.5% |
| n | 767479 | 7.4% |
| r | 713250 | 6.9% |
| s | 712075 | 6.8% |
| h | 546308 | 5.3% |
| l | 440323 | 4.2% |
| Other values (47) | 2643438 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 55266 | |
| T | 32942 | 9.6% |
| S | 26411 | 7.7% |
| M | 18411 | 5.4% |
| C | 18018 | 5.3% |
| B | 17744 | 5.2% |
| I | 17021 | 5.0% |
| H | 16227 | 4.7% |
| W | 15802 | 4.6% |
| D | 11908 | 3.5% |
| Other values (35) | 112574 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 185255 | |
| , | 108716 | |
| ' | 26768 | 8.0% |
| " | 8711 | 2.6% |
| : | 2185 | 0.6% |
| ? | 1882 | 0.6% |
| ; | 1266 | 0.4% |
| / | 678 | 0.2% |
| ! | 542 | 0.2% |
| & | 303 | 0.1% |
| Other values (9) | 96 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9488 | |
| 0 | 6927 | |
| 9 | 5749 | |
| 2 | 3765 | 9.6% |
| 8 | 2400 | 6.1% |
| 3 | 2333 | 6.0% |
| 5 | 2305 | 5.9% |
| 4 | 2241 | 5.7% |
| 7 | 1932 | 4.9% |
| 6 | 1899 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 16 | |
| ~ | 7 | |
| = | 4 | 13.8% |
| ¬ | 1 | 3.4% |
| ± | 1 | 3.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 8 | |
| ^ | 2 | 13.3% |
| ¸ | 2 | 13.3% |
| ¨ | 2 | 13.3% |
| ´ | 1 | 6.7% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 6 | |
| ° | 4 | |
| ¦ | 1 | 7.7% |
| ® | 1 | 7.7% |
| � | 1 | 7.7% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 270 | |
| £ | 13 | 4.6% |
| ¢ | 2 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2245469 | ||
| 7 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6784 | |
| [ | 20 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6334 | |
| ] | 18 | 0.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 13 | |
| ’ | 2 | 13.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30712 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 16 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 1 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ۪ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10738524 | |
| Common | 2665170 | 19.9% |
| Arabic | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1256194 | |
| a | 893557 | 8.3% |
| t | 845626 | 7.9% |
| i | 800526 | 7.5% |
| o | 777423 | 7.2% |
| n | 767479 | 7.1% |
| r | 713250 | 6.6% |
| s | 712075 | 6.6% |
| h | 546308 | 5.1% |
| l | 440323 | 4.1% |
| Other values (93) | 2985763 |
Common
| Value | Count | Frequency (%) |
| 2245469 | ||
| . | 185255 | 7.0% |
| , | 108716 | 4.1% |
| - | 30712 | 1.2% |
| ' | 26768 | 1.0% |
| 1 | 9488 | 0.4% |
| " | 8711 | 0.3% |
| 0 | 6927 | 0.3% |
| ( | 6784 | 0.3% |
| ) | 6334 | 0.2% |
| Other values (51) | 30006 | 1.1% |
Arabic
| Value | Count | Frequency (%) |
| ۪ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13400163 | |
| None | 3528 | < 0.1% |
| Punctuation | 2 | < 0.1% |
| Arabic | 1 | < 0.1% |
| Specials | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2245469 | ||
| e | 1256194 | 9.4% |
| a | 893557 | 6.7% |
| t | 845626 | 6.3% |
| i | 800526 | 6.0% |
| o | 777423 | 5.8% |
| n | 767479 | 5.7% |
| r | 713250 | 5.3% |
| s | 712075 | 5.3% |
| h | 546308 | 4.1% |
| Other values (80) | 3842256 |
None
| Value | Count | Frequency (%) |
| é | 1368 | |
| á | 317 | 9.0% |
| í | 206 | 5.8% |
| ü | 165 | 4.7% |
| ö | 149 | 4.2% |
| ó | 142 | 4.0% |
| è | 140 | 4.0% |
| ç | 109 | 3.1% |
| ä | 103 | 2.9% |
| ã | 96 | 2.7% |
| Other values (62) | 733 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 2 |
Arabic
| Value | Count | Frequency (%) |
| ۪ | 1 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
| Distinct | 89 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.898655873 |
| Minimum | 1 |
|---|---|
| Maximum | 9.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3.5 |
| Q1 | 5.2 |
| median | 6.1 |
| Q3 | 6.8 |
| 95-th percentile | 7.6 |
| Maximum | 9.9 |
| Range | 8.9 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.234987351 |
|---|---|
| Coefficient of variation (CV) | 0.2093675878 |
| Kurtosis | 0.5978266621 |
| Mean | 5.898655873 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.7609643007 |
| Sum | 506429.1 |
| Variance | 1.525193758 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.4 | 3407 | 4.0% |
| 6.2 | 3347 | 3.9% |
| 6.5 | 3335 | 3.9% |
| 6.3 | 3318 | 3.9% |
| 6.6 | 3195 | 3.7% |
| 6.1 | 3139 | 3.7% |
| 6.7 | 3085 | 3.6% |
| 6.8 | 3073 | 3.6% |
| 6 | 2832 | 3.3% |
| 7 | 2768 | 3.2% |
| Other values (79) | 54356 |
| Value | Count | Frequency (%) |
| 1 | 16 | < 0.1% |
| 1.1 | 20 | < 0.1% |
| 1.2 | 20 | < 0.1% |
| 1.3 | 25 | < 0.1% |
| 1.4 | 26 | < 0.1% |
| 1.5 | 33 | |
| 1.6 | 49 | |
| 1.7 | 45 | |
| 1.8 | 70 | |
| 1.9 | 68 |
| Value | Count | Frequency (%) |
| 9.9 | 1 | < 0.1% |
| 9.8 | 4 | < 0.1% |
| 9.7 | 3 | < 0.1% |
| 9.5 | 2 | < 0.1% |
| 9.4 | 2 | < 0.1% |
| 9.3 | 7 | < 0.1% |
| 9.2 | 9 | < 0.1% |
| 9.1 | 8 | < 0.1% |
| 9 | 19 | |
| 8.9 | 28 |
| Distinct | 14933 |
|---|---|
| Distinct (%) | 17.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9493.489605 |
| Minimum | 99 |
|---|---|
| Maximum | 2278845 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 99 |
|---|---|
| 5-th percentile | 114 |
| Q1 | 205 |
| median | 484 |
| Q3 | 1766.5 |
| 95-th percentile | 33416.2 |
| Maximum | 2278845 |
| Range | 2278746 |
| Interquartile range (IQR) | 1561.5 |
Descriptive statistics
| Standard deviation | 53574.35954 |
|---|---|
| Coefficient of variation (CV) | 5.64327363 |
| Kurtosis | 325.2774404 |
| Mean | 9493.489605 |
| Median Absolute Deviation (MAD) | 344 |
| Skewness | 14.61947943 |
| Sum | 815063550 |
| Variance | 2870212000 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102 | 316 | 0.4% |
| 101 | 315 | 0.4% |
| 105 | 309 | 0.4% |
| 100 | 308 | 0.4% |
| 106 | 295 | 0.3% |
| 112 | 292 | 0.3% |
| 111 | 288 | 0.3% |
| 107 | 285 | 0.3% |
| 113 | 285 | 0.3% |
| 110 | 282 | 0.3% |
| Other values (14923) | 82880 |
| Value | Count | Frequency (%) |
| 99 | 5 | < 0.1% |
| 100 | 308 | |
| 101 | 315 | |
| 102 | 316 | |
| 103 | 276 | |
| 104 | 268 | |
| 105 | 309 | |
| 106 | 295 | |
| 107 | 285 | |
| 108 | 275 |
| Value | Count | Frequency (%) |
| 2278845 | 1 | |
| 2241615 | 1 | |
| 2002816 | 1 | |
| 1807440 | 1 | |
| 1780147 | 1 | |
| 1755490 | 1 | |
| 1632315 | 1 | |
| 1619920 | 1 | |
| 1604280 | 1 | |
| 1572674 | 1 |
| Distinct | 2506 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29193851.35 |
| Minimum | 0 |
|---|---|
| Maximum | 3.5 × 1011 |
| Zeros | 62179 |
| Zeros (%) | 72.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 100000 |
| 95-th percentile | 24000000 |
| Maximum | 3.5 × 1011 |
| Range | 3.5 × 1011 |
| Interquartile range (IQR) | 100000 |
Descriptive statistics
| Standard deviation | 1456806506 |
|---|---|
| Coefficient of variation (CV) | 49.90114147 |
| Kurtosis | 39661.9928 |
| Mean | 29193851.35 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 176.1867109 |
| Sum | 2.506438108 × 1012 |
| Variance | 2.122285197 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 62179 | |
| 1000000 | 1004 | 1.2% |
| 2000000 | 765 | 0.9% |
| 3000000 | 698 | 0.8% |
| 5000000 | 655 | 0.8% |
| 10000000 | 559 | 0.7% |
| 500000 | 525 | 0.6% |
| 1500000 | 498 | 0.6% |
| 4000000 | 472 | 0.5% |
| 20000000 | 454 | 0.5% |
| Other values (2496) | 18046 | 21.0% |
| Value | Count | Frequency (%) |
| 0 | 62179 | |
| 1 | 10 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 2 | < 0.1% |
| 10 | 5 | < 0.1% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 × 1011 | 1 | < 0.1% |
| 1.2 × 1011 | 1 | < 0.1% |
| 8 × 1010 | 1 | < 0.1% |
| 7 × 1010 | 1 | < 0.1% |
| 6.62168 × 1010 | 1 | < 0.1% |
| 5.9 × 1010 | 1 | < 0.1% |
| 5.5 × 1010 | 1 | < 0.1% |
| 5 × 1010 | 2 | < 0.1% |
| 3.5 × 1010 | 3 | |
| 3 × 1010 | 5 |
| Distinct | 14858 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3479774.818 |
| Minimum | 0 |
|---|---|
| Maximum | 936662225 |
| Zeros | 70529 |
| Zeros (%) | 82.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 13540658.5 |
| Maximum | 936662225 |
| Range | 936662225 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 21706663.98 |
|---|---|
| Coefficient of variation (CV) | 6.237950764 |
| Kurtosis | 276.9293875 |
| Mean | 3479774.818 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.36640586 |
| Sum | 2.98756067 × 1011 |
| Variance | 4.711792612 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70529 | |
| 1000000 | 19 | < 0.1% |
| 1500000 | 17 | < 0.1% |
| 8144 | 17 | < 0.1% |
| 509 | 13 | < 0.1% |
| 1400000 | 13 | < 0.1% |
| 2000000 | 12 | < 0.1% |
| 46808 | 11 | < 0.1% |
| 3270000 | 11 | < 0.1% |
| 1300000 | 11 | < 0.1% |
| Other values (14848) | 15202 | 17.7% |
| Value | Count | Frequency (%) |
| 0 | 70529 | |
| 30 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 74 | 1 | < 0.1% |
| 78 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 147 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 936662225 | 1 | |
| 858373000 | 1 | |
| 760507625 | 1 | |
| 700426566 | 1 | |
| 678815482 | 1 | |
| 659363944 | 1 | |
| 652270625 | 1 | |
| 623357910 | 1 | |
| 620181382 | 1 | |
| 608581744 | 1 |
| Distinct | 30411 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8339442.376 |
| Minimum | 0 |
|---|---|
| Maximum | 2797800564 |
| Zeros | 54839 |
| Zeros (%) | 63.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 211466 |
| 95-th percentile | 26266775.2 |
| Maximum | 2797800564 |
| Range | 2797800564 |
| Interquartile range (IQR) | 211466 |
Descriptive statistics
| Standard deviation | 55319615.66 |
|---|---|
| Coefficient of variation (CV) | 6.633490966 |
| Kurtosis | 408.2062341 |
| Mean | 8339442.376 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.02837166 |
| Sum | 7.159828252 × 1011 |
| Variance | 3.060259877 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 54839 | |
| 8144 | 15 | < 0.1% |
| 46808 | 10 | < 0.1% |
| 509 | 9 | < 0.1% |
| 97182 | 6 | < 0.1% |
| 14000000 | 5 | < 0.1% |
| 2874 | 5 | < 0.1% |
| 11000000 | 4 | < 0.1% |
| 1500000 | 4 | < 0.1% |
| 220000000 | 4 | < 0.1% |
| Other values (30401) | 30954 |
| Value | Count | Frequency (%) |
| 0 | 54839 | |
| 1 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2797800564 | 1 | |
| 2790439092 | 1 | |
| 2195169869 | 1 | |
| 2068224036 | 1 | |
| 2048359754 | 1 | |
| 1670401444 | 1 | |
| 1656963790 | 1 | |
| 1518814206 | 1 | |
| 1515048151 | 1 | |
| 1450026933 | 1 |
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 72550 |
| Missing (%) | 84.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.89688087 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 43 |
| median | 57 |
| Q3 | 69 |
| 95-th percentile | 84 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 17.78487427 |
|---|---|
| Coefficient of variation (CV) | 0.3181729284 |
| Kurtosis | -0.4316876445 |
| Mean | 55.89688087 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.1614852504 |
| Sum | 743708 |
| Variance | 316.3017529 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 64 | 303 | 0.4% |
| 55 | 296 | 0.3% |
| 65 | 291 | 0.3% |
| 57 | 291 | 0.3% |
| 61 | 284 | 0.3% |
| 62 | 281 | 0.3% |
| 49 | 278 | 0.3% |
| 66 | 275 | 0.3% |
| 68 | 273 | 0.3% |
| 58 | 273 | 0.3% |
| Other values (89) | 10460 | 12.2% |
| (Missing) | 72550 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 8 | |
| 8 | 8 | |
| 9 | 15 | |
| 10 | 12 | |
| 11 | 16 |
| Value | Count | Frequency (%) |
| 100 | 16 | < 0.1% |
| 99 | 8 | < 0.1% |
| 98 | 9 | < 0.1% |
| 97 | 14 | < 0.1% |
| 96 | 27 | |
| 95 | 15 | < 0.1% |
| 94 | 27 | |
| 93 | 27 | |
| 92 | 25 | |
| 91 | 42 |
| Distinct | 1213 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 7597 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.0408265 |
| Minimum | 1 |
|---|---|
| Maximum | 10472 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 9 |
| Q3 | 27 |
| 95-th percentile | 186 |
| Maximum | 10472 |
| Range | 10471 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 178.5114112 |
|---|---|
| Coefficient of variation (CV) | 3.877241673 |
| Kurtosis | 581.6803158 |
| Mean | 46.0408265 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 17.71999159 |
| Sum | 3603063 |
| Variance | 31866.32391 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7546 | 8.8% |
| 2 | 6559 | 7.6% |
| 3 | 5373 | 6.3% |
| 4 | 4581 | 5.3% |
| 5 | 3929 | 4.6% |
| 6 | 3457 | 4.0% |
| 7 | 3045 | 3.5% |
| 8 | 2665 | 3.1% |
| 9 | 2483 | 2.9% |
| 10 | 2177 | 2.5% |
| Other values (1203) | 36443 | |
| (Missing) | 7597 | 8.8% |
| Value | Count | Frequency (%) |
| 1 | 7546 | |
| 2 | 6559 | |
| 3 | 5373 | |
| 4 | 4581 | |
| 5 | 3929 | |
| 6 | 3457 | |
| 7 | 3045 | |
| 8 | 2665 | 3.1% |
| 9 | 2483 | 2.9% |
| 10 | 2177 | 2.5% |
| Value | Count | Frequency (%) |
| 10472 | 1 | |
| 8869 | 1 | |
| 8232 | 1 | |
| 7639 | 1 | |
| 7553 | 1 | |
| 7207 | 1 | |
| 6938 | 1 | |
| 6718 | 1 | |
| 5392 | 1 | |
| 5261 | 1 |
| Distinct | 595 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 11797 |
| Missing (%) | 13.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.47998866 |
| Minimum | 1 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 670.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 23 |
| 95-th percentile | 126 |
| Maximum | 999 |
| Range | 998 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 58.3391584 |
|---|---|
| Coefficient of variation (CV) | 2.122968795 |
| Kurtosis | 34.73872691 |
| Mean | 27.47998866 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 5.028834999 |
| Sum | 2035113 |
| Variance | 3403.457402 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8506 | 9.9% |
| 2 | 6822 | 7.9% |
| 3 | 5437 | 6.3% |
| 4 | 4722 | 5.5% |
| 5 | 3884 | 4.5% |
| 6 | 3215 | 3.7% |
| 7 | 2774 | 3.2% |
| 8 | 2451 | 2.9% |
| 9 | 2168 | 2.5% |
| 10 | 1941 | 2.3% |
| Other values (585) | 32138 | |
| (Missing) | 11797 | 13.7% |
| Value | Count | Frequency (%) |
| 1 | 8506 | |
| 2 | 6822 | |
| 3 | 5437 | |
| 4 | 4722 | |
| 5 | 3884 | |
| 6 | 3215 | 3.7% |
| 7 | 2774 | 3.2% |
| 8 | 2451 | 2.9% |
| 9 | 2168 | 2.5% |
| 10 | 1941 | 2.3% |
| Value | Count | Frequency (%) |
| 999 | 1 | |
| 909 | 1 | |
| 838 | 1 | |
| 833 | 1 | |
| 830 | 1 | |
| 813 | 1 | |
| 782 | 1 | |
| 769 | 1 | |
| 755 | 1 | |
| 740 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| imdb_title_id | title | original_title | year | date_published | genre | duration | country | language | director | writer | production_company | actors | description | avg_vote | votes | budget | usa_gross_income | worlwide_gross_income | metascore | reviews_from_users | reviews_from_critics | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | tt0000009 | Miss Jerry | Miss Jerry | 1894 | 1894-10-09 | Romance | 45 | USA | None | Alexander Black | Alexander Black | Alexander Black Photoplays | Blanche Bayliss, William Courtenay, Chauncey Depew | The adventures of a female reporter in the 1890s. | 5.9 | 154 | 0 | 0 | 0 | NaN | 1.0 | 2.0 |
| 1 | tt0000574 | The Story of the Kelly Gang | The Story of the Kelly Gang | 1906 | 1906-12-26 | Biography, Crime, Drama | 70 | Australia | None | Charles Tait | Charles Tait | J. and N. Tait | Elizabeth Tait, John Tait, Norman Campbell, Bella Cola, Will Coyne, Sam Crewes, Jack Ennis, John Forde, Vera Linden, Mr. Marshall, Mr. McKenzie, Frank Mills, Ollie Wilson | True story of notorious Australian outlaw Ned Kelly (1855-80). | 6.1 | 589 | 2250 | 0 | 0 | NaN | 7.0 | 7.0 |
| 2 | tt0001892 | Den sorte drøm | Den sorte drøm | 1911 | 1911-08-19 | Drama | 53 | Germany, Denmark | None | Urban Gad | Urban Gad, Gebhard Schätzler-Perasini | Fotorama | Asta Nielsen, Valdemar Psilander, Gunnar Helsengreen, Emil Albes, Hugo Flink, Mary Hagen | Two men of high rank are both wooing the beautiful and famous equestrian acrobat Stella. While Stella ignores the jeweler Hirsch, she accepts Count von Waldberg's offer to follow her home, ... | 5.8 | 188 | 0 | 0 | 0 | NaN | 5.0 | 2.0 |
| 3 | tt0002101 | Cleopatra | Cleopatra | 1912 | 1912-11-13 | Drama, History | 100 | USA | English | Charles L. Gaskill | Victorien Sardou | Helen Gardner Picture Players | Helen Gardner, Pearl Sindelar, Miss Fielding, Miss Robson, Helene Costello, Charles Sindelar, Mr. Howard, James R. Waite, Mr. Osborne, Harry Knowles, Mr. Paul, Mr. Brady, Mr. Corker | The fabled queen of Egypt's affair with Roman general Marc Antony is ultimately disastrous for both of them. | 5.2 | 446 | 45000 | 0 | 0 | NaN | 25.0 | 3.0 |
| 4 | tt0002130 | L'Inferno | L'Inferno | 1911 | 1911-03-06 | Adventure, Drama, Fantasy | 68 | Italy | Italian | Francesco Bertolini, Adolfo Padovan | Dante Alighieri | Milano Film | Salvatore Papa, Arturo Pirovano, Giuseppe de Liguoro, Pier Delle Vigne, Augusto Milla, Attilio Motta, Emilise Beretta | Loosely adapted from Dante's Divine Comedy and inspired by the illustrations of Gustav Doré the original silent film has been restored and has a new score by Tangerine Dream. | 7.0 | 2237 | 0 | 0 | 0 | NaN | 31.0 | 14.0 |
| 5 | tt0002199 | From the Manger to the Cross; or, Jesus of Nazareth | From the Manger to the Cross; or, Jesus of Nazareth | 1912 | 1913 | Biography, Drama | 60 | USA | English | Sidney Olcott | Gene Gauntier | Kalem Company | R. Henderson Bland, Percy Dyer, Gene Gauntier, Alice Hollister, Samuel Morgan, James D. Ainsley, Robert G. Vignola, George Kellog, J.P. McGowan | An account of the life of Jesus Christ, based on the books of the New Testament: After Jesus' birth is foretold to his parents, he is born in Bethlehem, and is visited by shepherds and wise... | 5.7 | 484 | 0 | 0 | 0 | NaN | 13.0 | 5.0 |
| 6 | tt0002423 | Madame DuBarry | Madame DuBarry | 1919 | 1919-11-26 | Biography, Drama, Romance | 85 | Germany | German | Ernst Lubitsch | Norbert Falk, Hanns Kräly | Projektions-AG Union (PAGU) | Pola Negri, Emil Jannings, Harry Liedtke, Eduard von Winterstein, Reinhold Schünzel, Else Berna, Fred Immler, Gustav Czimeg, Karl Platen, Bernhard Goetzke, Magnus Stifter, Paul Biensfeldt, Willy Kaiser-Heyl, Alexander Ekert, Robert Sortsch-Pla | The story of Madame DuBarry, the mistress of Louis XV of France, and her loves in the time of the French revolution. | 6.8 | 753 | 0 | 0 | 0 | NaN | 12.0 | 9.0 |
| 7 | tt0002445 | Quo Vadis? | Quo Vadis? | 1913 | 1913-03-01 | Drama, History | 120 | Italy | Italian | Enrico Guazzoni | Henryk Sienkiewicz, Enrico Guazzoni | Società Italiana Cines | Amleto Novelli, Gustavo Serena, Carlo Cattaneo, Amelia Cattaneo, Lea Giunchi, Bruto Castellani, Augusto Mastripietri, Cesare Moltini, Olga Brandini, Ignazio Lupi, Giovanni Gizzi, Lia Orlandini, Matilde Guillaume, Ida Carloni Talli, Giuseppe Gambardella | An epic Italian film "Quo Vadis" influenced many of the later movies. | 6.2 | 273 | 45000 | 0 | 0 | NaN | 7.0 | 5.0 |
| 8 | tt0002452 | Independenta Romaniei | Independenta Romaniei | 1912 | 1912-09-01 | History, War | 120 | Romania | None | Aristide Demetriade, Grigore Brezeanu | Aristide Demetriade, Petre Liciu | Societatea Filmului de Arta Leon Popescu | Aristide Demetriade, Constanta Demetriade, Constantin Nottara, Pepi Machauer, Aurel Athanasescu, Jeny Metaxa-Doro, Nicolae Soreanu, Vasile Toneanu, Aristita Romanescu, Elvire Popesco, M. Vîrgolici, C. Nedelcovici, Mihail Tancovici-Cosmin, Ion Dumitrescu, Gheorghe Meliseanu | The movie depicts the Romanian War of Independence (1877-1878). | 6.7 | 198 | 400000 | 0 | 0 | NaN | 4.0 | 1.0 |
| 9 | tt0002461 | Richard III | Richard III | 1912 | 1912-10-15 | Drama | 55 | France, USA | English | André Calmettes, James Keane | James Keane, William Shakespeare | Le Film d'Art | Robert Gemp, Frederick Warde, Albert Gardner, James Keane, George Moss, Howard Stuart, Virginia Rankin, Violet Stuart, Carey Lee, Carlotta De Felice | Richard of Gloucester uses manipulation and murder to gain the English throne. | 5.5 | 225 | 30000 | 0 | 0 | NaN | 8.0 | 1.0 |
Last rows
| imdb_title_id | title | original_title | year | date_published | genre | duration | country | language | director | writer | production_company | actors | description | avg_vote | votes | budget | usa_gross_income | worlwide_gross_income | metascore | reviews_from_users | reviews_from_critics | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 85845 | tt9904250 | La reina de los lagartos | La reina de los lagartos | 2019 | 2019-10-05 | Fantasy | 63 | None | Spanish, Catalan | Juan González, Nando Martínez | Juan González, Nando Martínez | Aquí y Allí Films | Javier Botet, Bruna Cusí, Miki Esparbé, Ivan Labanda | A spaceship is about to come to pick up Javi, so him and Berta have to put an end to their summer love. | 4.8 | 103 | 0 | 0 | 0 | NaN | NaN | 5.0 |
| 85846 | tt9904802 | Enemy Lines | Enemy Lines | 2020 | 2020-05-04 | War | 92 | UK | English, Polish, Russian, German | Anders Banke | Michael Wright, Tom George | Happy Hour Films | Ed Westwick, John Hannah, Tom Wisdom, Corey Johnson, Pawel Delag, Gary Grant, Daniel Jillings, Scott Haining, Ekaterina Vladimirova, Vladimir Epifantsev, Kirill Pletnyov, Patrik Karlson, Andrey Karako, Jean-Marc Birkholz, Aleksandr Zlatopolskiy | In the frozen, war torn landscape of occupied Poland during World War II, a crack team of allied commandos are sent on a deadly mission behind enemy lines to extract a rocket scientist from the hands of the Nazis. | 5.0 | 764 | 0 | 0 | 0 | NaN | 29.0 | 6.0 |
| 85847 | tt9905412 | Ottam | Ottam | 2019 | 2019-03-08 | Drama | 120 | India | Malayalam | Zam | Rajesh k Narayan | Thomas Thiruvalla Films | Nandu Anand, Roshan Ullas, Manikandan R. Achari, Alencier Ley Lopez, Kalabhavan Shajohn, Rohini, Madhuri Dilip, Althaf, Sudheer Karamana, Thezni Khan, Rajesh Sharma | Set in Trivandrum, the story of Ottam unfolds in a day, and progresses through the lives of two youngsters - Abhi and Vinay. What does destiny have in store for these young men? | 7.4 | 494 | 4000000 | 0 | 4791 | NaN | 1.0 | NaN |
| 85848 | tt9905462 | Pengalila | Pengalila | 2019 | 2019-03-08 | Drama | 111 | India | Malayalam | T.V. Chandran | T.V. Chandran | Benzy Productions | Lal, Akshara Kishor, Iniya, Narain, Renji Panicker, Indrans, Priyanka Nair | An unusual bond between a sixty year old Dalit worker Azhagan and an eight year old middle class girl Radha. Within no time their bond grows stronger. However, his proximity to Radha and her mother doesn't go down well with Radha's father. | 8.8 | 553 | 10000000 | 0 | 0 | NaN | NaN | NaN |
| 85849 | tt9906644 | Manoharam | Manoharam | 2019 | 2019-09-27 | Comedy, Drama | 122 | India | Malayalam | Anvar Sadik | None | chakkalakal Films | Vineeth Sreenivasan, Aparna Das, Basil Joseph, Indrans, Delhi Ganesh, Deepak Parambol, Hareesh Peradi, Nandu, Sreelakshmi, Ahamed Siddique, Nandini Sree, V.K. Prakash, Kalaranjini, Jude Anthany Joseph, Nisthar Sait | Manoharan is a poster artist struggling to find respect for his profession, after the advent of printing technology. He tries hard to get into the mainstream, by picking up design software skills. Will he succeed? | 6.8 | 491 | 0 | 0 | 0 | NaN | 9.0 | 1.0 |
| 85850 | tt9908390 | Le lion | Le lion | 2020 | 2020-01-29 | Comedy | 95 | France, Belgium | French | Ludovic Colbeau-Justin | Alexandre Coquelle, Matthieu Le Naour | Monkey Pack Films | Dany Boon, Philippe Katerine, Anne Serra, Samuel Jouy, Sophie Verbeeck, Carole Brana, Benoît Pétré, Aksel Ustun, Mathieu Lardot, Olivier Sa, Julien Prevost, Antoine Mathieu, David Ban, Stan, Guillaume Clémencin | A psychiatric hospital patient pretends to be crazy. In charge of caring for this patient, a caregiver will begin to doubt the mental state of his "protégé". | 5.3 | 398 | 0 | 0 | 3507171 | NaN | NaN | 4.0 |
| 85851 | tt9911196 | De Beentjes van Sint-Hildegard | De Beentjes van Sint-Hildegard | 2020 | 2020-02-13 | Comedy, Drama | 103 | Netherlands | German, Dutch | Johan Nijenhuis | Radek Bajgar, Herman Finkers | Johan Nijenhuis & Co | Herman Finkers, Johanna ter Steege, Leonie ter Braak, Stef Assen, Annie Beumers, Jos Brummelhuis, Reinier Bulder, Daphne Bunskoek, Karlijn Koel, Karlijn Lansink, Marieke Lustenhouwer, Jan Roerink, Ferdi Stofmeel, Aniek Stokkers, Belinda van der Stoep | A middle-aged veterinary surgeon believes his wife pampers him too much. In order to get away from her, he fakes the onset of dementia. | 7.7 | 724 | 0 | 0 | 7299062 | NaN | 6.0 | 4.0 |
| 85852 | tt9911774 | Padmavyuhathile Abhimanyu | Padmavyuhathile Abhimanyu | 2019 | 2019-03-08 | Drama | 130 | India | Malayalam | Vineesh Aaradya | Vineesh Aaradya, Vineesh Aaradya | RMCC Productions | Anoop Chandran, Indrans, Sona Nair, Simon Britto Rodrigues | None | 7.9 | 265 | 0 | 0 | 0 | NaN | NaN | NaN |
| 85853 | tt9914286 | Sokagin Çocuklari | Sokagin Çocuklari | 2019 | 2019-03-15 | Drama, Family | 98 | Turkey | Turkish | Ahmet Faik Akinci | Ahmet Faik Akinci, Kasim Uçkan | Gizem Ajans | Ahmet Faik Akinci, Belma Mamati, Metin Keçeci, Burhan Sirmabiyik, Orhan Aydin, Tevfik Yapici, Yusuf Eksi, Toygun Ates, Aziz Özuysal, Dilek Ölekli, Arcan Bunial, Seval Hislisoy, Ergül Çolakoglu, Gülçin Ugur, Ibrahim Balaban | None | 6.4 | 194 | 0 | 0 | 2833 | NaN | NaN | NaN |
| 85854 | tt9914942 | La vida sense la Sara Amat | La vida sense la Sara Amat | 2019 | 2020-02-05 | Drama | 74 | Spain | Catalan | Laura Jou | Coral Cruz, Pep Puig | La Xarxa de Comunicació Local | Maria Morera Colomer, Biel Rossell Pelfort, Isaac Alcayde, Lluís Altés, Joan Amargós, Pepo Blasco, Cesc Casanovas, Oriol Cervera, Pau Escobar, Jordi Figueras, Arés Fuster, Judit Martín, Martí Múrcia, Mariona Pagès, Francesca Piñón | Pep, a 13-year-old boy, is in love with a girl from his grandparents village, Sara Amat. One summer night Sara disappears without a trace. After a few hours, Pep finds her hiding in his room. | 6.7 | 102 | 0 | 0 | 59794 | NaN | NaN | 2.0 |